Identifying Correction Rules for Auto Editing

نویسندگان

  • Anta Huang
  • Tsung-Ting Kuo
  • Ying-Chun Lai
  • Shou-De Lin
چکیده

This paper describes a framework to extract the effective correction rules from the sentence-aligned corpus and show a practical application: auto-editing using the found rules. The framework exploits the methodology of finding Levenshtein distance between sentences to identify the key parts of the rules and then use the editing corpus to filter, condense and refine the rules. We produce the rule candidates of such form, A => B, where A stands for the erroneous pattern and B is the correct pattern. Our framework is language independent, therefore can be applied to other languages easily. The evaluation of the discovered rules reveals that 67.2% of the top 1500 ranked rules are annotated as correct or mostly correct by experts. Based on the rules, we create an online auto-editing system for demo on http://mslab.csie.ntu.edu.tw/~kw/new_demo.html.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discovering Correction Rules for Auto Editing

This paper describes a framework that extracts effective correction rules from a sentence-aligned corpus and shows a practical application: auto-editing using the discovered rules. The framework exploits the methodology of finding the Levenshtein distance between sentences to identify the key parts of the rules and uses the editing corpus to filter, condense, and refine the rules. We have produ...

متن کامل

MT9V126 Data Sheet

Features • Low-power CMOS image sensor with integrated image flow processor (IFP) and video encoder • 1/4-inch optical format, VGA resolution (640H x 480V) • ±2.5% additional columns and rows to compensate for lens alignment tolerances • Integrated lens distortion correction • Overlay generator for dynamic bitmap overlay • Integrated video encoder for NTSC/PAL with overlay capability and 10-bit...

متن کامل

Critique of Manuscript-Correction/ The Role of Editors in Presenting the Author: A review of Toghray Mashhadi's biography in his newly published Book of Essays, Fatima Mehri

The Role of Editors in Presenting the Author  A Review of Toghray Mashhadi's Biography in His Newly Published Book of Essays  Fatemeh Mehri Associate Professor of Persian Language and Literature, Shahid Beheshti University  [email protected]   Abstract Researchers in the field of editing and correction manuscripts consider the writing of introductions as part of the correction process. T...

متن کامل

Identifying the time of a step change in AR(1) auto-correlated simple linear profiles

Assuming a first-order auto-regressive model for the auto-correlation structure between observations, in this paper, a transformation method is first employed to eliminate the effect of auto-correlation. Then, a maximum likelihood estimator (MLE) of a step change in the parameters of the transformed model is derived and three separate EWMA control charts are used to monitor the parameters of th...

متن کامل

ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool

This paper presents ProphetMT, a tree-based SMT-driven Controlled Language (CL) authoring and post-editing tool. ProphetMT employs the source-side rules in a translation model and provides them as auto-suggestions to users. Accordingly, one might say that users are writing in a ‘Controlled Language’ that is ‘understood’ by the computer. ProphetMT also allows users to easily attach structural in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010